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Figure 1: 

PREF-l 
SCS0009 



PREF-l 
SCS0009 



PREF-l 
SCS00O9 



MIATGALLRVLLLLLAFGHS TYGAECDPPCDPQYGFCEADNVCRCHVGWEGPLCDKC 

MPSGCRCLHLVCLLCILGAPGQPVRADDCSSHCDLAHGCCAPDGSCRCDPGWEGLHCERC 

* ★ ** * *★**★*. **★ **** *..* 

VTAPGCVNGVCKEPWQCICKDGWDGKFCEIDVRACTSTPCANNGTCVDLEKGQYECSCTP 

VRMPGCQHGTCHQPWQCICHSGWAGKFCDK 

* * * * _ * ik * * * ★ * * . * ★ * * * * . 



Additional exon coding for 30 aa is inserted in 
SECP59 (INCYTE WO0226982) HKAAV61 (HGS WO0134768) 
PRO40 (GENENTECH WO0224888) PREF-l (HYSEQ WO0157233) 
LP230S (LILLY WO0248361) AAG67516 (SKB WO0166690) 
35.31 (BIODOOR WO0226809) 



GFSGKDCQHKAGPCV INGSPCQHGGACVDDEGQASHASCLCPPGFSGNFCEIVAATNSCT 
GFHGRDCERKAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGFVGARCEVN — VDDCL 
** * ** **★*★ ****** * **.* * . .* * ** * **. * 
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25 



PREF-l 
SCS0009 



PNPCENDGVCTDIGGDFRCRCPAGFVDKTCSRPVSNCAS GPCQNGGTCLQHTQVSFECLC 
MRPC AN GAT C L DG I NRF S C LC PEGF AGRFC T I N LD DC AS RPCQRG ARCRDRVH D -FDCLC 
▲ ** * * * .**★*★* . *. . *** *** * * . . *.*** 



Initial Met in NOV8 
(CURAGEN WO0255704) 
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PREF-l 
SCS0009 



PREF-l 
SCS0009 



PREF-l 
SCS0009 



KPPFMGPTCAKKRGAS -PVQVTHLPSGYGLTYRLTPGVHELPVQQPEQHILKVSMKELN - 
PSGYGGKTCELVLPVPDPPTTVDTPLGP -TSAWVPATGPAPHSAG - AGLLRI SVKEWR 
^ * ** * * * . * * .*..*.**. 

KSTPLLTEGQAI CFTI LGVLTSLWLGTVAI VFLNKCETWV SNLRYNHTFRKKKNLLLQY 

RQE AGLGE PS LVAL WFGALTAALVLAT VLLTLR AWRRGVCPPGPCCYPAPHYAPA 

* * . . .***..****. .* 

NSGEELAVNIIFPEKIDMTTFNKEAGDEEI - 
CQDQECQVSMLPAGLPL PRDLPPEPGKTTAL 
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Figure 2: 

Clustal W alignment of predicted amino acid sequences of SCS0009 prediction and splice 
variants SV3, SV4 and SV5. 

CLUSTAL W (1.83) multiple sequence alignment 



SV4 

10 SV5 

SV3 

SCS0009 



15 SV4 

SV5 
SV3 

SCS0009 

20 sv4 

SV5 
SV3 

SCS0009 

25 sv4 

SV5 
SV3 

SCS0009 



30 



SV4 
SV5 
SV3 

SCS0009 



MP SGCRCLHLVCLLCILGAPGQPVRADDCSSHCDLAHGCCAPDGSCRCDPGWEGLHCERC 
MPSGCRCLHLVCLLCILGAPGQPVRADDCSSHCDLAHGCCAPDGSCRCDPGWEGLHCERC 
MPSGCRCLHLVCLLCILGAPGQPVRADDCSSHCDLAHGCCAPDGSCRCDPGWEGLHCERC 
MP S GCRCLH LVCLLC I LGAPGQ P VRAD DCSSHCDLAHGCCAPDGSCRCDPGWEGLHCERC 
************************************************************ 

VRMPGCQHGTCHQPWQCICHSGWAGKFCDKDEHICTTQSPCQNGGQCMYDGGGEYHCVCL 
VRMPGCQHGTCHQPWQCICHSGWAGKFCDKDEHICTTQSPCQNGGQCMYDG GGEYHCVCL 

VRMPGCQHGTCHQPWQCICHSGWA DEHICTTQSPCQNGGQCMYDGGGEYHCVCL 

VRMPGCQHGTCHQPWQCICHSGWAGKFCDK 



PGFHGRDCERKAGP 

PGFHGRDCERKAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGFVGARCEVNVDDCLM 

PGFHGRDCERKAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGFVGARCEVNVDDCLM 

-GFHGRDCERKAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGFVGARCEVNVDDCLM 



RPCANGATCLDGINRFSCLCPEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPS 
RPCANGATCLDGINRFSCLCPEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPS 
RPCANGATCLDGINRFSCLCPEGFAGRFCTINLDDCASRPCQRGARC RDRVHDFDCLCPS 



GYGGKTCELVLPVPDPPTTVDTPLGPTSAVWPATGPAPHSAGAGLLRISVKEWRRQEA 
GYGGKTCELVLPVPDPPTTVDTPLGPTSAWVPATGPAPHSAGAGLLRISVKEWRRQEA 
GYGGKTCELVLPVPDPPTTVDTPLGPTSAWVPATGPAPHSAGAGLLRISVKEWRRQEA 
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Figure 3: 

Clustal W alignment of prectted amino acid sequences of SCS0009 with SV1 and SV2 



SCS0009 
SV1-ORF 
SV2-ORF 



M PS GCRCLH LVCLLC I LGAPG QPVRA 
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SCS0009 
SV1-ORF 
SV2-ORF 



DDCSSHCDLAHGCCAPDGSC RCDPGWEGLHCERCVRMPGCQHGTCHQPWQCICHSGWAGK 

MPGCQHGTCHQPWQCICHSGWAGK 

MPGCQHGTCHQPWQCICHSGWA — 



********************** 



15 



20 



25 



30 



35 



40 



SCS0009 
SVl-ORF 
SV2-ORF 



SCS0009 
SVl-ORF 
SV2-ORF 



SCS0009 
SVl-ORF 
SV2-ORF 



SCS0009 
SVl-ORF 
SV2-ORF 



SCS0009 
SVl-ORF 
SV2-ORF 



FCDK- 



GFHGRDCERKAG PCEQAGSPCRNGG 



FCDKDEHICTTQSPCQNGGQCMYDGGGEYHCVCLPGFHGRDCERKAGPCEQAGSPCRNGG 

DEHICTTQSPCQNGGQCMYDGGGEYHCVCLPGFHGRDCERKAGPCEQAGSPCRNGG 

****** ************************************************** 



qcqddqgfalnftcrclvgfvgarcevnvddclmrpcangatcldginrfsclcpegfag 
qcqddqgfalnftcrclvgfvgarcevnvddclmrpcangatcldginrfsclcpegfag 
qcqddqgfalnftcrclvgfvgarcevnvdex:lmrpcangatcldginrfsclcpegfag 
****************** ****************************************** 

rfctinlddcasrpcqrgarcrdrvhdfdclcpsgyggktcelvlpvpdppttvdtplgp 
rfctinlddcasrpcqrgarcrdrvhdfdclcpsgyggktcelvlpvpdppttvdtplgp 
rfctinlddcasrpcqrgarcrdrvhdfdclcpsgyggktce lvlpvpdppttvdtplgp 
************************************************************ 

tsavwpatgpaphsagagllri svkewrrqeaglgepslvalwfgaltaalvlatvl 
ts avwpatgpaphsagagllri svke wrrqeaglge pslvalwfgaltaalvlatvl 
tsavwpatgpaphsagagllrisvkewrrqeaglgepslvalwfgaltaalvlatvl 
************************************************************ 

ltlrawrrgvcppgpccypaphyapacqdqecqvsmlpaglplprdlppepgkttal . 
ltlrawrrgvcppgp ccypaphyapacqdqecqvsmlpaglplprdlppepgkttal . 
ltlrawrrgvcppgpccypaphyapacqdqecqvsmlpaglplprdlppepgkttal . 
********************************************************* 



XY = exon boundaries. 

45 In the translation, the SV1 and SV2 se quences are shown representing the longest ORF 
available. 

The predicted signal peptide of SCS0009 is shown highlighted in yellow. 
The SV1 and SV2 longest ORFs do not contain predicted signal peptides. 
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Figure 4: 



Nucleotide sequence of SCS0009 prediction with translation 

5 1 AGACGGCAAC GTGGACAGGA AGAAGCGGAG GGCGAGGAGG AGCAGAGGAG CACACAGATG 

61 AAGCAGGTGT CCACGCGTCC GGCCGTCCAT CCGTCCGTCC CTCCTGGGGC CGGCGCTGAC 
121 CATGCCCAGC GGCTGCCGCT GCCTGCATCT CGTGTGCCTG TTGTGCATTC TGGGGGCTCC 
MPS GCR CLH L V C L LCI LGA 

10 181 CGGTCAGCCT GTCCGAGCCG ATGACTGCAG CTCCCACTGT GACCTGGCCC ACGGCTGCTG 

PGQP V R A DDC SSHC DLA HGC 

241 TGCACCTGAC GGCTCCTGCA GGTGTGACCC GGGCTGGGAG GGG CTGCACT GTGAGCGCTG 
CAPD GSC RCD PGWE GLH CER 

15 

301 TGTGAGGATG CCTGGCTGCC AGCACGGTAC CTGCCACCAG CCATGGCAGT GCATCTGCCA 
CVRM PGC QHG TCHQ PWQ CIC 

361 CAGTGGCTGG GC AGGCAAGT TCTGTGACAA AGGCTTCCAT GGGCGTGACT GCGAGCGCAA 
20 HSGW AGK FCD KGFH GRD CER 

421 GGCTGGACCC TGTGAACAGG CAGGCTCCCC ATGCCGCAAT GGCGGGCAGT GCCAGGACGA 
KAGP CEQ AGS PCRN GG Q CQD 

25 481 CCAGGGCTTT GCTCTCAACT TCACGTGCCG CTGCTTGGTG GGCTTTGTGG GTGCCCGCTG 

DQGF ALN FTC RCLV GFV GAR 

541 TGAGGTAAAT GTGGATGACT GCCTGATGCG GCCTTGTGCT AACGGTGCCA CCTGCCTTGA 
CEVN VD D CLM RPCA NGA TCL 

30 

601 CGGCATAAAC CGCTTCTCCT GCCTCTGTCC TGAGGGCTTT GCTGGACGCT TCTGCACCAT 
DGIN R.FS CLC PEGF AGR FCT 

661 CAACCTGGAT GACTGTGCCA GCCGCCCATG CCAGAGAGGG GCCCGCTGTC G GGACCGTGT 
35 I N L D DCA SRP CQRG ARC RDR 

721 CCACGACTTC GACTGCCTCT GCCCCAGTGG CTATGGTGGC AAGACCTGTG AGCTTGTCTT 
VHDF DCL CPS GYGG KTC ELV 

40 781 ACCTGTCCCA GACCCCCCAA CCACAGTGGA CACCCCTCTA GGGCCCACCT CAGCTGTAGT 

LPVP DPP TTV DTPL GPT SAV 

841 GGTACCTGCC ACGGGGCCAG CCCCCCACAG CGCAGGGGCT GGTCTGCTGC GGATCTCAGT 
VVPA TGP APH SAGA GLL RI S 

45 

901 GAAGGAGGTG GTGCGGAGGC AAGAGGCTGG GCTAGGTGAG CCTAGCTTGG TGGCCCTGGT 
VKEV VRR QEA GLGE PSL V A L 

961 GGTGTTTGGG GCCCTCACTG CTGCCCTGGT TCTGGCTACT GTGTTGCTGA CCCTGAGGGC 
50 VVFG ALT AAL VLAT VLL TLR 

1021 CTGGCGCCGG GGTGTCTGCC CCCCTGGACC CTGTTGCTAC CCTGCCCCAC ACTATGCTCC 
AWRR GVC PPG P C C Y PAP H Y A 

55 1081 AGCGTGCCAG GACCAGGAGT GTCAGGTTAG CATGCTGCCA GCAGGGCTCC CCCTGCCACG 

PACQ DQE CQV SMLP AGL PLP 



60 



1141 TGACTTGCCC CCTGAGCCTG GAAAGACCAC AGCACTGTGA TGGAGGTGGG GGCTTTCTGG 
RDLP PEP GKT TAL 
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1201 CCCCCTTCCT CACCTCTTCC ACCCCTCAGA CTGGAGTGGT CCGTTCTCA C CACCCTTCAG 

1261 CTTGGGTACA CACACAGAGG AGACCTCAGC CTCACACCAG AAATATTATT TTTTTAATAC 

1321 ACAGAATGTA AGATGGAATT TTATCAAATA AAACTATGAA AATGCAAGTG GGCTCCTATG 

1381 CCAGAAAAAC CCACCTGGCG TTCCAGATGC AAGAGGGCCA GAGCAGAGGC CTGGTTCTGG 

1441 GGAAGCCTCA GGATGC TGCC CACCAAGGAG TGATTTCCAA AGAGTAATCC AGGGTGCCCT 

1501 TTTCCCTTCT GGGGAAGTGT GGAGAGGTAG AGCCCCAGAG GAGAATGTAA ACAAGCAGCC 

1561 AGCACCTCTG TATAGGCCCG GCCTGGATCA GAGAGAGGGG AGAACTCTGC AGGGTGTGGG 

1621 ATTGGGCTCA GGGACCTCCG AGTGAGGCAG GGACTCCCTG CTG 
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Figure 5: 

Nucleotide sequence with translation of cDNA insert in image clone 5478078 (SCS0009 -SV3) 

5 

1 ATGCCCAGCG GCTGCCGCTG CCTGCATCTC GTGTGCCTGT TGTGCATTCT GGGGGCTCCC 
MPS GCR CLHL VCL LCI LGAP 

61 GGTCAGCCTG TCCGAGCCGA TGACTGCAGC TCCCACTGTG ACCTGGCCCA CGGCTGCTGT 
10 GQP VRA DDCS SHC DLA HGCC 

121 GCACCTGACG GCTCCTGCAG GTGTGACCCG GGCTGGGAGG GGCTGCACTG TGAGCGCTGT 
APD GSC RCDP G WE GLH CERC 

15 181 GTGAGGATGC CTGGCTGCCA GCACGGTACC TGCCACCAGC CATGGCAGTG CATCTGCCAC 

VRM PGC QHGT CHQ PWQ CICH 



20 



241 AGTGGCTGGG CAGATGAACA TATCTGTACC ACGCAGTCCC CCTGCCAGAA TGGAGGCCAG 
SGW ADE HICT TQS PCQ NGGQ 

301 TGCATGTATG ACGGGGGCGG TGAGTACCAT TGTGTGTGCT TACCAGGCTT CCATGGGCGT 
CMY DGG GEYH CVC LPG FHGR 

361 GACTGCGAGC GCAAGGCTGG ACCCTGTGAA CAGGCAGGCT CCCCATGCCG CAATGGCGGG 
25 DCE RKA GPCE QAG SPC RNGG 

421 CAGTGCCAGG ACGACCAGGG CTTTGCTCTC AACTTCACGT GCCGCTGCTT GGTGGGCTTT 
QCQ DDQ GFAL NFT CRC LVGF 

30 481 GTGGGTGCCC GCTGTGAGGT AAATGTGGAT GACTGCCTGA TGCGGCCTTG TGCTAACGGT 

VGA RCE VNVD DCL MRP CANG 



35 



50 



541 GCCACCTGCC TTGACGGCAT AAACCGCTTC TCCTGCCTCT GTCCTGAGGG CTTTGCTGGA 
ATC LDG INRF SCL CPE GFAG 

601 CGCTTCTGCA CCATCAACCT GGATGACTGT GCCAGCCGCC CATGCCAGAG AGGGGCCCGC 
RFC TIN L D D C ASR PCQ RGAR 



661 TGTCGGGACC GTGTCCATGA CTTCGACTGC CTCTGCCCCA GTGGCTATGG TGGCAAGACT 
40 CRD RVH DFDC LCP SGY GGKT 

721 TGTGAGCTTG TCTTACCTGT CCCAGACCCC CCAACCACAG TGGACACCCC TCTAGGGCCC 
CEL VLP VPDP PTT VDT PLGP 

45 781 ACCTCAGCTG TAGTGGTACC TGCCAC GGGG CCAGCCCCCC ACAGCGCAGG GGCTGGTCTG 

TSA VVV PATG PAP HSA GAGL 



841 CTGCGGATCT CAGTGAAGGA GGTGGTGCGG AGGCAAGAGG CTGGGCTAGG TGAGCCTAGC 

LRI SVK EVVR RQE AGL GEP S 

901 TTGGTGGCCC TGGTGGTGTT TGGGGCCCTC ACTGCTGCCC TGGTTCTGGC TACTGTGTTG 
LVA LVV FGAL T A A LVL ATVL 



961 CTGACCCTGA GGGCCTGGCG CCGGGGTGTC TGCCCCCCTG GACCCTGTTG CTACCCTGCC 
55 I»TI» RAW RRGV CPP GPC CYPA 

1021 CCACACTATG CTCCAGCGTG CCAGGACCAG GAGTGTCAGG TTAGCATGCT GCCAGCAGGG 
PHY APA CQDQ ECQ VSM LPAG 



60 1081 CTCCCCCTGC CACGTGACTT GCCCCCTGAG CCTGGAAAGA CCACAGCACT G 

LPL PRD LPPE PGK TTA L 
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Figure 6 : 

Alignment of predicted amino acid sequence of SCS0009 with SCS0009-SV3 

5 SCS0009 MPS GCRCLHLVCLLC ILGAPGQP VRADDC S S HCDLAHGCCAPDGS CRC DPGWEGLHCERC 

SCS000 9SV3 MPSGCRCLHLVC LLC I LG APGQ P VRADDCS S HC DL AHG CC APDGSCRCDPGWEGLHCERC 

************************************************************ 

SCS0009 VRMPGCQHGTCHQPWQCICHSGWAGK ""5^55^5? 

10 SCS0009SV3 VRMPGCQHGTCHQPWQCICHSGWADEHICTTQSPCQ NGGQCMYDGGGEYHCVCLPGFHGR 

************************.: -* ***** 



15 



20 



SCS0009 DCEPJCAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGFVGARCEVNVDDCLMRPCANG 
SCS0009SV3 E)CERKAGPCEQAGSPCRNGGQCQDDQGFALNFTCRCLVGWGARCEVNVDDCLMRPCANG 
************************************************************ 

SCS0009 ATCLDGINRFSCLCPEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPSGYGGKT 
SCS0009SV3 ATCLDGINRFSCLCPEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPSGYGGKT 
******* ***************************************************** 

SCS0009 CELVLPVPDPPTTVDTPLGPTSAVWPATGPAPHSAGAGLLRISVKEWRRQEAGLGEPS 
SCS0009SV3 cELVLPVPDPPTTVDTPLGPTSAWVPATGPAPHSAGAGLLRISVKEWRRQEAGLGEPS 
******************************* ***************************** 



25 SCS0009 LVALVVFGALTAALVLATVLLTLRAWRRGVCPPGPCCYPAPHYAPACQDQECQVSMLPAG 
SCS0009SV3 LVAL WFG ALT AALVLAT VLLTLRAWRRGVC P PG PCC YP A PH Y APACQ DQECQ VSML PAG 

******************************************************* ***** 



SCS0009 LPLPRDLPPEPGKTTAL 
30 SCS0009SV3 LPLPRDLPPEPGKTTAL 
***************** 
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Figure 7: 

Map of expression vector pEAK12d 

Molecule: pEAK12 d, 8760 bps DNA Circular 

File Name: pEAK12DEST.cm5 

Description: Mammalian cell expression vector (plasraid ID 11345) 
Molecule Features: 



Type 


Start 


End 




Name 


Description 


REGION 


2 


595 






pmb -ori 


GENE 


596 


1519 




Amp 




REGION 


1690 


27 95 




EF -lalpha 




REGION 


2703 


2722 






position of pEAK12F 


REGION 


2796 


2845 






MCS 


MARKER 
GENE 


2855 
3256 


3915 




attRl 
CmR 




GENE 


4257 


4562 




ccdB 




MARKER 


4603 




C 


attR2 




REGION 


4 733 


4733 






MCS 


REGION 


4734 


5162 






poly A/splice 


REGION 


4819 


4848 


C 




position of pEAK12R 


GENE 


5781 


5163 


c 


PUR 


PUROMYCIN 


REGION 


6005 


5782 


c 


tK 


tK promoter 


REGION 


6500 


6006 


c 


Ori P 




GENE 


8552 


6500 


c 


EBNA -1 




REGION 


8553 


8752 




sv40 
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Figure 8: 

Map of Expression vector pDEST12.2 

Molecule: pDEST 12.2, 7278 bps DNA Circular 

File Name: pDEST12 -2.cm5 

Description: Euk aryoric expression vector 



Type 


Start 


End 


Name 


Description 


REGION 


15 


608 


CMV 


CMV promoter 


MARKER 


648 




M13R 


M13R primer 


REGION 


687 


706 


SP6 


SP6 promoter 


REGION 


730 


854 


at tRl 




GENE 


963 


1622 


Cm 




GENE 


1964 


2269 


ccdB 




REGION 


2310 


2434 


attR2 




GENE 


2484 


2464 


C T7 


T7 promoter 


MARKER 


2512 




C 21M13 


21M13 primer 


REGION 


2784 


3050 


p A 


SV40 polyadenylation signal 


REGION 


3176 


3631 


fl 


fl intergenic region 


REGION 


3791 


4099 


P SV40 


SV40 ori & early promoter 


GENE 


4158 


4952 


Neo 


synthetic poly adenylation 


REGION 


5016 


5064 


pA 


GENE 


5475 


6335 


Amp 




REGION 


6480 


7153 


ori 


pUC ori 




25 



WO 2004/063222 



ifl/540845 



PCT/EP2003/051092 



- 10/36 - 



10 



15 



Figure 9 












MapofpDONR221 










Molecule : 




pDONR221, 


4759 bps 


DNA Circular 




File Name 




pDONR221.cm5, dated 03 Jun 2003 




Description: 










Type 


Start 


End 


Name 


Description 




REGION 


295 


268 C 


rrnB T2 


transcription termination 


sequence 


REGION 


470 


427 C 


rrnB Tl 


transcription termination 


sequence 


REGION 


536 


553 


21M13 


Ml 3 Forward primer 




REGION 


570 


801 


attPl 






GENE 


1197 


1502 


ccdB 






GENE 


1844 


2503 


Cm r 


Chloramphenicol' resistance gene 


REGION 


2751 


2982 


attP2 






REGION 


3040 


3023 C 


Ml 3 Rev 


Ml 3 Reverse primer 




GENE 


3153 


396 2 


Kan r 






REGION 


4083 


4756 


pUC ori 
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£)/tf1464S t 
B5SS4S80 
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&pl116 



B2MI444 
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a 1 563 
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21M13 



attPl 



B«i3575 — I Kanr 



pDONR221 ioco^ 

4759 bps 



3000 

/ 2000 



ccdB I 



p Ml 1161 
5*11163 



BsfXl1298 



\SlBlK16 
Xrel1416 



M13Rev 
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Figure 10 

Map of pENTR-SCS0009SV3-6HIS 



Molecule : 
File Name: 
Description: 



3705 bps DNA Circular 



pENTR_SCS0009 -SV3-6HIS-V1, 
14879[1] -cm5 

Ligation of SCS0009 -SV3-6HIS into pdonr221 
Description 
21M13 Forward primer 

-SV3-6HIS 

Ml 3 Reverse primer 
Kanamycin resistance gene 
pUC origin 



Type 


Start 


End 


Name 


MARKER 


536 






REGION 


568 


670 


attLl 


GENE 


677 


1825 


SCS0009 


REGION 


1829 


1930 


attL2 


MARKER 


1986 




C 


GENE 


2099 


2908 


Kan 


REGION 


3029 


3702 


pUC 
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Figure 11 

Map of pEAK12d-SCS0009SV3-6HIS 

Molecule: pEAK12d -SCS0009-SV3-6HIS-V1, 8098 bps DNA Circular 

File Name: 14885(1] .cm5 

5 Description: pEAK12 DES with two re combi nation sites attRl and attR2 between 
which the cDNA is inserted 



Type 


Start 


End 




Name 


Description 


REGION 


2 


595 




pmb -ori 




GENE 


596 


1519 




Amp 




REGION 


1690 


2795 




EF -lalpha 




REGION 


2796 


2845 






MCS' » 


REGION 


2855 


2874 




attBl 




GENE 


2888 


4039 




SCS0009 -SV3-6HIS 


REGION 


4044 


4065 




attB2 




REGION 


4071 


4071 






»MCS 


REGION 


4072 


4500 




•A 


poly A/splice 


GENE 


5119 


4501 


C 


PUR 


PUROMYCIN 


REGION 


5343 


5120 


C 


tK 


tK promoter 


REGION 


5838 


5344 


c 


Ori P 




GENE 


7890 


5838 


c 


EBNA -1 




REGION 


7891 


8090 




sv 40 
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Figure12 

Map of pDEST12.2-SCS0009SV3-6HIS 

Molecule: pDEST12.2_SCS0009 -SV3-6HIS-V1, 6784 bps DNA Circular 

File Name: 14889 [1] .cm5 

5 Description: Ligation of SCS0009 -SV3-6HIS-V1 into pDEST 12.2 



Type 


Start 


End 




Name 


Description 


REGION 


15 


608 




CMV promoter 




REGION 


648 


665 




M13R 


Forward primer 


REGION 


687 


706 




SP6 promoter 




REGION 


730 


756 




attBl 




GENE 


763 


1911 




SCS0009 -SV3- 


6HIS 


REGION 


1940 


1915 


C 


at tB2 




GENE 


1990 


1970 


C 


T7 


T7 promoter 


REGION 


2018 


2001 


c 




21M13 reverse primer 


REGION 


2290 


2556 




pA 


SV40 polyadenylation signal 


REGION 


2682 


3137 




fl 


fl intergenic region 


REGION 


3297 


3605 




P SV40 
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Figure 13 

Nucleotide sequence of SCS0009 prediction with translation 

5 1 AGACGGCAAC GTGGACAGGA AGAAGCGGAG GGCGAGGAGG AGCAGAGGAG CACACAGATG 

61 AAGCAGGTGT CCACGCGTCC GGCCGTCCAT CCGTCCGTCC CTCCTGGGGC CGGCGCTGAC 
• 121 CATGCCCAGC GGCTGCCGCT GCCTGCATCT CGTGTGCCTG TTGTGCATTC TGGGGGCTCC 
MPS GCR C L H LVCL LCI LGA 

10 181 CGGTCAGCCT GTCCGAGCCG ATGACTGCAG CTCCCACTGT GACCTGGCCC ACGGCTGCTG 

PGQP V R A DDC SSHC DLA HGC 

241 TGCACCTGAC GGCTCCTGCA GGTGTGACCC GGGCTGGGAG GGGCTGCACT GTGAGCGCTG 
CAPD GSC RCD P G W E G L H CER 



15 



30 



45 



60 



301 TGTGAGGATG CCTGGCTGCC AGCACGGTAC CTGCCACCAG CCATGGCAGT GCATCTGCCA 
CVRM PGC QHG TC HQ PWQ CIC 



361 CAGTGGCTGG GCAGGCAAGT TCTGTGACAA AGGCTTCCAT GGGCGTGACT GCGAGCGCAA 
20 HSGW AGK FCD KGFH GRD CER 

421 GGCTGGACCC TGTGAACAGG CAGGCTCCCC ATGCCGCAAT GGCGGGCAGT GCCAGGACGA 
KAGP CEQ AGS PCRN GGQ CQD 

25 481 CCAGGGCTTT GCTCTCAACT TCACGTGCCG CTGCTTGGTG GGCTTTGTGG GTGCCCGCTG 

DQGF ALN FTC RCLV GFV GAR 

541 TGAGGTAAAT GTGGATGACT GCCTGATGCG GCCTTGTGC T AACGGTGCCA CCTGCCTTGA 
CEVN VDD C L M RPCA NGA TCL 

601 CGGCATAAAC CGCTTCTCCT GCCTCTGTCC TGAGGGCTTT GCTGGACGCT TCTGCACCAT 
DGIN RFS CLC PEGF AGR FCT 



661 CAACCTGG AT GACTGTGCCA GCCGCCCATG CCAGAGAGGG GCCCGCTGTC GGGACCGTGT 
35 INLD DCA SRP CQRG ARC RDR 

721 CCACGACTTC GACTGCCTCT GCCCCAGTGG CTATGGTGGC AAGACCTGTG AGCTTGTCTT 
VHDF DCL CPS GYGG KTC ELV 

40 781 ACCTGTCCCA GACCCCCCAA CCACAGTGGA CACCCCTCTA GGGCCCACCT CAGCTGTAGT 

LPVP DPP TTV DTPL GPT SAV 



841 GGTACCTGCC ACGGGGCCAG CCCCCCACAG CGCAGGGGCT GGTCTGCTGC GGATCTCAGT 
VVPA TGP APH SAGA GLL RIS 

901 GAAGGAGGTG GTGCGGAGGC AAGAGGCTGG GCTAGGTGAG CCTAGCTTGG TGGCCCTGGT 
VKEV VRR QEA GLGE PSL VAL 



961 GGTGTTTGGG GCCCTCACTG CTGCCCTGGT TCTGGCTACT GTGTTGC TGA CCCTGAGGGC 
50 VVFG ALT A A L VLAT VLL TLR 

1021 CTGGCGCCGG GGTGTCTGCC CCCCTGGACC CTGTTGCTAC CCTGCCCCAC ACTATGCTCC 
AWRR GVC PPG PCCY PAP HYA 

55 1081 AGCGTGCCAG GACCAG GAGT GTCAGGTTAG CATGCTGCCA GCAGGGCTCC CCCTGCCACG 

PACQ DQE CQV SMLP AGL PLP 

1141 TGACTTGCCC CCTGAGCCTG GAAAGACCAC AGCACTGTGA TGGAGGTGGG GGCTTTCTGG 
RDLP PEP GKT TAL 

1201 CCCCCTTCCT CACCTCTTCC ACCCCTCAGA CTGGAGTGGT CCGTTCTCAC CACCCTTCAG 
1261 CTTGGGTACA CACACAGAGG AGACCTCAGC CTCACACCAG AAATATTATT TTTTTAATAC 



WO 2004/063222 



30/540845 

PCTYEP2003/051092 



-15/36- 



1321 ACAGAATGTA AGATGGAATT TTATCAAATA AAACTATGAA AATGCAAGTG GGCTCCTATG 

1381 CCAGAAAAAC CCACCTGGCG TTCCAGATGC AAGAGGGCCA GAGCAGAGGC CTGGTTCTGG 

1441 GGAAGCCTCA GGATGCTGCC CACCAAGGAG TGATTTCCAA AGAGTAATCC AGGGTGCCCT 

1501 TTTCCCTTCT GGGGAAGTGT GGAGAGGTAG AGCCCCAGAG GAGAATGTAA ACAAGCAGCC 

1561 AGCACCTCTG TATAGGCCCG GCCTGGATCA GAGAGAGGGG AGAACTCTGC AGGGTGTGGG 

1621 ATTGGGC TCA GGGACCTCCG AGTGAGGCAG GGACTCCCTG CTG 
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Figure 14 

Nucleotide sequence with translation of cDNA insert in image clone 3349698 (SCS0009 -SV4) 

1 ATGCCCAGCG GCTGCCGCTG CCTGCATCTC GTGTGCCTGT TGTGCATTCT GGGGGCTCCC 
MPS GCR CLHL VCL LCI LGAP 

61 GGTCAGCCTG TCCGAGCCGA TGACTGCAGC TCCCACTGTG ACCTGGCCCA CGGCTGCTGT 
GQP V R A DDCS SHC DLA HGCC 

121 GCACCTGACG GCTCCTGCAG GTGTGACCCG GGCTGGGAGG GGCTGCACTG TGAGCGCTGT 
APD GSC RCDP GWE GLH CERC 



181 GTGAGGATGC CTGGCTGCCA GCACGGTACC TGCCACCAGC CATGGCAGTG CATCTGCCAC 
15 VRM PGC QHGT CHQ PWQ CICH 

241 AGTGGCTGGG CAGGCAAGTT CTGTGACAAA G ATGAACATA TCTGTACCAC GCAGTCCCCC 
SGW AGK FCDK DEH ICT TQSP 

20 301 TGCCAGAATG GAGGCCAGTG CATGTATGAC GGGGGCGGTG* AGTACCATTG TGTGTGCTTA 

CQN GGQ CMYD GGG EYH CVCL 



361 CCAGGCTTCC ATGGGCGTGA CTGCGAGCGC AAGGCTGGAC CC 
PGF HGR DCER KAG P 



25 
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Figure 15 

Alignment of predicted amino acid sequence of SCS0009 with SCS0009-SV4 



10 



15 



20 



25 
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SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 

SCS0009 
SCS0009SV4 



MPSGCRCLHLVCLLCILGAPGQPVRADDCSSHCDLAHGCCAPDGSCRCDP 
MPSGCRCLHLVCLLCILGAPGQPVRADDCSSHCDIAHGCCAPDGSCRCDP 



GWEGLHCERCVRMPGCQHGTCHQPWQCICHSGWAGKFCDK 

GWEGLHCERCVRMPGCQHGTCHQPWQCICHSGWAGKFCDKDEHICTTQSP 

GFHGRDCERKAGPCEQAGS PCRNGGQCQD 

CQN GGQCMYDGGGE YHC VCL PGFHGRDCERKAG P 

DQGFALNFTCRCLVGFVGARCEVNVDDCLMRPCANGATCLDGINRFSCLC 



PEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPSGYGGKTCELV 



LPVPDPPTTVDTPLGPTSAVWPATGPAPHSAGAGLLRISVKEWRRQEA 
GLGEPSLVALWFGAIiTAALVLATVLLTLRAWRRGVCPPGPCCYPAPHYA 



PACQDQECQVSMIiPAGLPLPRDLPPEPGKTTAL 
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Figure 16 

Map of expression vector pEAK12d 
Molecule: pEAK12 d, 8760 bps DNA Circular 

File Name: pEAK12DEST. cm5 

5 Description: Mammalian cell expression vector (plasmid ID 11345) 
Molecule Features: 
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Figure 17 

Map of Expression vector pDESTl2. 
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Figure 18 

Map of pDONR 221 

Molecule: pDONR221, 4759 bps DNA Circular 

File Name: pDONR221.cm5 , dated 03- Jun 2003 

5 Description: 
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Figure 19 

Map of pENTR-SCS0009SV4-6HIS 



Molecule: pENTR_SCS0009 -SV4-6HIS-V1, 2976 bps DNA Circular 

File Name: 15055 [l].cm5 

Description: Ligation of SCS0009 -SV4-6HIS into pdonr221 

Type Start End Name Description 

MARKER 536 21M13 Forward primer 

REGION 568 670 attLl 

GENE 677 1096 SCS0009 -SV4-6HIS 

REGION 1100 1201 attL2 

MARKER 1257 C Ml 3 Reverse primer 

GENE 1370 2179 Kan Kanamycin resistance gene 

REGION 2300 2973 pUC pUC origin 




15 



WO 2004/063222 



10/540 84 5 

PCT/EP2003/051092 



-22/36- 



Figure 20 

Map of pEAK12d-SCS0009SV4-6HIS 

Molecule: pEAK12d -SCS0009-SV4-6HIS-V1, 7369 bps DNA Circular 

File Name: 1506l[l].cm5 
5 Description: pEAKl2 DES with two recombination sites attRl and attR2 between 
which the cDNA is inserted 
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Figure 21 

Map of pDEST12.2-SCS0009SV4-6HIS 

Molecule: pDEST12.2_SCS0009 -SV4-6HIS-V1, 6055 bps DNA Circular 

File Name: 15063[l].cm5 
5 Description: Ligation of SCS0009 -SV4-6HIS-V1 into pDEST 12.2 
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Figure 22 

Nucleotide sequence of SCS0009 prediction with translation 

1 agacggcaac gtggacagga agaagcggag ggcgaggagg agcagaggag cacacagatg 
5 61 aagcaggtgt ccacgcgtcc ggccgtccat ccgtccgtcc ct cctggggc cggcgctgac 

121 catgcccagc ggctgccgct gcctgcatct cgtgtgcctg ttgtgcattc tgggggctcc 
mps gcr clh lvcl lei lga 

181 cggtcagcct gtccgagccg atgactgcag ctcccactgt gacctggccc aeggctgetg 
10 pgqP vra ddc sshc dla hgc 

241 tgcacctgac ggctcctgca ggtgtgaccc gggctgggag gggctgeact gtgagcgctg 
capd gsc red pgwe glh cer 

15 301 tgtgaggatg cctggctgcc ageaeggtae ctgccaccag ccatgg cagt gcatctgcca 

cvrm pgc qhg tchq pwq cic 

361 cagtggctgg gcaggcaagt tctgtgacaa aggcttccat gggcgtgact gcgagcgcaa 
hsgw agk fed kgfh grd cer 

20 

421 ggctggaccc tgtgaacagg caggctcccc atgccgcaat ggcgggcagt gecaggaega 
kagp ceq ags pern ggq cqd 

481 ccagggcttt gctctcaact tcacgtgccg ctgcttggtg ggctttgtgg gtgcccgctg 
25 dqgfalnftcrclvgfv gar 

541 tgaggtaaat gtggatgact gectgatgeg gccttgtgct aacggtgcca cctgccttga 
cevn vdd elm rpca nga tel 

30 601 eggcataaac cgcttctcct gcctctgtcc tgagggcttt getggacget tctgcaccat 

dgin rfs clc pegf agr fct 

661 caacctggat gactgtgcca gccgcccatg ccagagaggg gcccgctgtc gggaccgtgt 
inld dca srp cqrg arc rdr 

35 

721 ccacgacttc gactgcctct gccccagtgg ctatggtggc aagacctgtg agct tgtctt 
vhdf del cps g y g g k t c elv 

781 acctgtecca gaccccccaa ccacagtgga cacccctcta gggcccacct cagctgtagt 
40 lpvp dpp ttv dtpl gpt sav 

841 ggtacctgcc aeggggecag ccc cccacag egcagggget ggtctgctgc ggatctcagt 
vvpa tgp aph saga gll r.is 

45 901 gaaggaggtg gtgeggagge aagaggctgg gctaggtgag cctagcttgg tggccctggt 

vkev vr r qea glge psl val 

961 ggtgtttggg gccctcactg ctgccctggt tctggctact gtgttgctga ccctgagggc 
vvfg alt aal vlat vll tlr 

50 

1021 ctggcgccgg ggtgtctgcc cccctggacc ctgttgctac cctgccccac actatgctcc 
awrr gvc pp g pecy pap hya 

1081 agcgtgccag gaccaggagt gtcaggttag catgctgcca gcagggctcc ccctgccacg 
55 pacq dqe cqv smlp agl pip 
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10 



1141 



1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 



tgacttgccc cctgagcctg gaaagaccac agcactgtga tggaggtggg ggctttctgg 
rdlp pep gkt tal 



cccccttcct 
cttgggtaca 
acagaatgta 
ccagaaaaac 
ggaagcctca 
tttcccttct 
agcacctctg 
attgggctca 



cacctcttcc 
cacacagagg 
agatggaatt 
ccacctggcg 
ggatgctgcc 
ggggaagtgt 
tataggccc g 
gggacctccg 



acccctcaga 
agacctcagc 
ttatcaaata 
ttccagatgc 
caccaaggag 
ggagaggtag 
gcctggatca 
agtgaggcag 



ctggagtggt 
ctcacaccag 
aaactatgaa 
aagagggcca 
tgatttccaa 
agccccagag 
gagagagggg 
ggactccctg 



ccgttctcac 
aaatattatt 
aatgcaagtg 
gagcagaggc 
agagtaatcc 
gagaatgtaa 
agaactctgc 
ctg 



cacccttcag 

tttttaatac 

g gctcctatg 

ctggttctgg 

agggtgccct 

acaagcagcc 

agggtgtggg 
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Figure 23 

Nucleotide sequence with translation of SCS0009-SV5 PCR product indicating the positions 
of the SCS0009-AP1 , -AP2, -AP3 and -AP4 primers used to generate the SCS0009 
5 sequence. 



ftCsnnng-APi 

1 tccatccgtc cgtccctcct ggggccggcg ctgaccatgc ccagcggctg ccgctgcctg 
10 mpsgcrcl 

► 

61 catctcgtgt gcctgttgtg cattctgggg gctcccggtc agcctgtccg age cgatgac 
hlv ell cilg apg qpv radd 

15 

121 tgcagctccc actgtgacct ggcccacggc tgctgtgcac ctgacggctc ctgcaggtgt 
ess hed lahg cca pdg sere 

20 181 gacccgggct gggaggggct gcactgtgag cgctgtgtga ggatgcctgg ctgccagcac 

dpg weg lhce rev rmp gcqh 



241 ggtacctgcc accagccatg geagtgeate tgccacagtg gctgggcagg caagttctgt 
25 gtchqpwqcichsgwa g k f c 

gacaaagatg aacatatctg taccacgcag tccccctgcc agaatggagg ccagtgcatg 
dkd ehi cttq spc qng gqcm 



scsnonQ-AP3 ^ 

tatgaegggg gcggtgagta ccattgtgtg tgcttaccag gcttccatgg gcgtgactgc 
ydg gge yhev clp gfh grdc 

35 

421 gagegcaagg ctggaccctg tgaacaggca ggctccccat geegcaatgg cgggcagtgc 
erk agp ceqa gsp crn ggqc 

481 caggacgacc agggctttgc tctcaacttc acgtgccgc t gcttggtggg ctttgtgggt 
40 qddqgfalnftcrclvgfvg 

541 gcccgctgtg aggtaaatgt ggatgactgc ctgatgegge cttgtgctaa cggtgccacc 
arc evn vddc lmr pea ngat 

45 601 tgccttgacg gcataaaccg cttctcctgc ctctgtcctg agggctttgc tggaegctte 

eld gin rfsc lep egf agrf 

661 tgcaccatca acctggatga ctgtgccagc cgcccatgcc agagaggggc ccgctgtcgg 

cti nld dcas rp c qrg arcr 

50 

721 gaccgtgtcc acgacttcga ctgcctctgc cccagtggct atggtggcaa gacctgtgag 

drv hdf dele psg ygg ktce 



301 



30 



361 



781 cttgtcttac ctgtcccaga ccccccaacc acagtggaca cccctctagg gcccacctca 
55 lvlpvpdppttvdtplgpts 
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841 gctgtagtgg tacctgccac ggggccagcc ccccacagcg caggggctgg tctgctgcgg 
avv vpa tgpa phs aga g 1 1 r 

901 atctcagtga aggaggtggt gcggaggcaa gagg ctgggc taggtgagcc tagcttggtg 
5 isvkevvrrqeaglgepslv 

961 gccctggtgg tgtttggggc cctcactgct gccctggttc tggctactgt gttgctgacc 
alv vfg alta alv lat vllt 

10 1021 ctgagggcct ggcgccgggg tgtctgcccc cctggaccct gttgctaccc tgccccacac 
Ira wrr gvcp pgp ccy paph 



15 



20 



1081 tatgctccag cgtgccagga ccaggagtgt caggttagca tgctgccagc agggctcccc 
yap acq dqec qvs mlp aglp 

1141 ctgccacgtg acttgccccc tgagcctgga aagaccacag cactgtgatg gaggtgggg 
lpr dip pepg ktt al 

^ sr.Rnnnfl-AP/1 
Sequence in grey « bases not present in SCS0009 prediction 
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Figure 24 

Nucleotide sequence and translation of cloned SCS0009 ORF 

1 accatgccca gcggctgccg ctgcctgcat ctcgtgtgcc tgttgtgcat tctgggggct 
mp sgc rclh lvc 11c ilga 

5 

61 cccggtcagc ctgtccgagc cgatgactgc agctcccact gtgacctggc ccacggct gc 
pgq pvr addc ssh cdl ahgc 

121 tgtgcacctg acggctcctg caggtgtgac ccgggctggg aggggctgca ctgtgagcgc 
10 capdgscrcdpgweglhcer 

181 tgtgtgagga tgcctggctg cca gcacggt acctgccacc agccatggca gtgcatctgc 
cvr mpg cqhg tch qpw qcic 

15 241 cacagtggct gggcaggcaa gttctgtgac aaaggcttcc atgggcgtga ctgcgagcgc 

hsg wag k f c d kgf hgr dc er 

301 aaggctggac cctgtgaaca ggcaggctcc ccatgccgca atggcgggca gtgccaggac 
kag pee qags per ngg qcqd 

20 

361 gaccagggct ttgctctcaa cttcacgtgc cgctgcttgg tgggctttgt gggtgcccgc 
dqg fal n ftc rcl vgf vgar 

421 tgtgaggtaa atgtggatga ctgcctgatg cggccttgtg ctaacggtgc cacctgcctt 
25 cev nvd dclm rpc ang a tel 

481 gaeggcataa accgcttctc ctgcctctgt cctgagggct ttgctggacg ctt ctgcacc 
dgi nrf scl c peg fag rfct 

30 541 atcaacctgg atgactgtgc cagccgccca tgecagagag gggcccgctg tegggacegt 

inl ddc asrp cqr gar crdr 

601 gtccacgact tcgactgcct ctgccccagt ggctatggtg gcaagacctg tgagcttgtc 
vhd fdc leps gyg gkt celv 

35 

661 ttacctgtcc cagacccccc aaccacagtg gacacccctc tagggcccac ctcagctgta 
lpv pdp pttv dtp lgp t sav 

721 gtggtacctg ccacggggcc agccccccac agegcagggg ctggtctgct geggatctea 
40 vvp atg paph sag agl Iris 

781 gtgaaggagg tggtgcggag gcaagaggct gggctaggtg agectagett ggtggccctg 
vke vvr rqea gig eps Ival 

45 841 gtggtgtttg gggccctcac tgctgccctg gttctggcta ctgtgttgct gaccctgagg 

vvf gal taal via tvl ltlr 

901 gcctggcgcc ggggtgtctg cccccctgga ccctgttgct accctgccc c acactatget 
awr rgv eppg pec ypa phya 

50 

961 ccagcgtgcc aggaccagga gtgtcaggtt ageatgetge cagcagggct ccccctgcca 
pac qdq ecqv sml pag lplp 

55 1021 cgtgacttgc cccctgagcc tggaaagacc acagcactgt ga 
rdl ppe pgkt tal 
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Figure 25 

Map of pCR-Bluntll-TOPO-SCS0009 

Molecule: pCR-BluntII-TOPO-SCS0009, 5012 bps DNA Circular 
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Figure 26 

Map of pDONR221 

Molecule: pDONR221, 4759 bps DNA Circular 

File Name: pDONR22l.cm5 
5 Description: 
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Figure 27 

Map of expression vector pEAK12d 

Molecule: pEAK12 d, 8760 bps DNA Circular 

File Name: pEAK12DEST.cm5 

Description: Mammalian cell expression vector (plasmid ID 11345) 
Molecule Features: 
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Figure 28 

Map of Expression vector pDEST12.2 

Molecule: pDEST 12 .2, 7278 bps DNA Circular 

File Name: pDEST12 -2 .cm5 

5 Description: Eukaryoric expression vector 
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Figure 29 

Map of pENTR-SCS0009-6HIS 



3630 bps DNA Circular 



Molecule: pENTR_SCS0009 -6HIS-V1, 

File Name: 15057 [1] .cm5 f 

Description: Ligation of SCS0009 -6HIS into pdonr221 

Type Start End Name Description 

MARKER 536 21M13 Forward primer 

REGION 568 670 attLl 

GENE 677 1750 SCS0009 -6HIS 

REGION 1754 1855 attL2 

MARKER 1911 C Ml 3 Reverse primer 

GENE 2024 2833 Kan Kanamycin resistance gene 

REGION 2954 3627 pUC pUC origin 
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Figure 30 

Map of pEAK12d-SCS0009-6HIS 

Molecule: pEAK12d_SCS0009 -6HIS, 8023 bps DNA Circular 

File Name: 15062 [1] .cm5. 

Description: pEAK12 DES with two recombination sites attRl and attR2 between 
which the cDNA is inserted 

Description 
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Figure 31 

Map ofpDEST12 

Molecule : 
File Name: 
Description 



2-SCS0009-6HIS 

pDEST12.2 SCS0009 -6HIS-V1, 6709 bps DNA Circular 
15064 [1] .cm5 

Ligation of SCS0009 -6HIS-V1 into pDEST 12.2 
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Figure 32 

Domains within the query sequence SCS0009 of 352 residues 
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Domains within the query sequence SCS0009-SV5 of 383 residues is identical to 
sptremblnewlAAQ88493IAAQ88493 
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Transmembrane segments as predicted by the TMHMM2 program (Sfe), coiled coil regions determined 
by the Coils2 program (""*") and Segments of low compositional compl exity, determined by the SEG 
program (T~~), signal peptides determined by the Siacleave program (~ ~), GPI anchors are 
indicated by C~~). Regions containing repeats detected by Prospero. but not covered by domains are 
indicated by ED. 
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